TO APPEAR : SDAIR 1995 Generating Synthetic
نویسندگان
چکیده
In this paper we describe work on a system for modeling errors in the output of OCR systems. The project is motivated by the desire to evaluate the performance of various text analysis systems under varying, yet controlled conditions. We describe a set of symbol and page models which are used to degrade an ideal text by introducing errors which typically occur during scanning, decomposition and recognition of document images. A rst generation of the software is described which implements the page models and allows the use of transition probabilities , either extracted from real data or generated synthetically, to corrupt text.
منابع مشابه
Generating the synthetic CT (sCT) and synthetic MR (sMR: sT1w/sT2w) images of the brain using atlas based method
Introduction: Radiation therapy planning (RTP) is one of the clinical applications in which both CT scan and MRI are used. MR and CT images are applied to determine the target volume and calculation of dose distribution, respectively. In addition, using two imaging modalities increases the department workload and cost. In this study, an algorithm was presented to create synthet...
متن کاملGenerating Synthetic Computed Tomography and Synthetic Magnetic Resonance (sMR: sT1w/sT2w) Images of the Brain Using Atlas-Based Method
Introduction: Nowadays, magnetic resonance imaging (MRI) in combination with computed-tomography (CT) is increasingly being used in radiation therapy planning. MR and CT images are applied to determine the target volume and calculate dose distribution, respectively. Since the use of these two imaging modalities causes registration uncertainty and increases department w...
متن کاملGenerating Representative Synthetic Workloads An Unsolved Problem
Synthetic disk request traces are convenient and popular workloads for performance evaluation of storage subsystem designs and implementations. This paper develops an approach for validating synthetic disk request generators. Using this approach, commonly-used simplifying assumptions about workload characteristics (e.g., uniformly-distributed starting addresses and Poisson arrivals) are shown t...
متن کاملA Hybrid Learning Model of Abductive Reasoning
Multicausal abductive tasks appear to have deliberate and implicit components: people generate and modify explanations using a series of recognizable steps, but these steps appear to be guided by an implicit hypothesis evaluation process. This paper proposes a hybrid symbolic-connectionist learning architecture for multicausal abduction. The architecture tightly integrates a symbolic Soar model...
متن کاملReal-time Integration of Synthetic Computer Graphics into Live Video Scenes
In commercials and motion pictures, computer graphics is often used to achieve special effects, e.g., adding synthetic dinosaurs in “Jurassic Park”. This process usually implies special camera equipment and a careful and time consuming post processing of single frames. We consider a simplified scenario, where synthetic objects are added automatically to a live scene in real-time. Reference poin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995